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ps) . Abstract 

We consider the triangle-free process: given an integer n, start by taking a uniformly random 
ordering of the edges of the complete n- vertex graph K n . Then, traverse the ordered edges and 
\& \ add each traversed edge to an (initially empty) evolving graph - unless its addition creates a 

triangle. We study the evolving graph at around the time where 0(n 3 / 2+e ) edges have been 
traversed for any fixed e £ (0, 10 -10 ). At that time and for any fixed triangle-free graph F, we 
give an asymptotically tight estimation of the expected number of copies of F in the evolving 
graph. For F that is balanced and have density smaller than 2 (e.g., for F that is a cycle 
of length at least 4), our argument also gives a tight concentration result for the number of 
copies of F in the evolving graph. Our analysis combines Spencer's original branching process 
approach for analysing the triangle-free process and the semi-random method. 
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In this paper we consider the triangle- free process. This is a random greedy process that generates 
a triangle-free graph as follows. Given n £ N, take a uniformly random ordering of the edges of 



the complete n-vertex graph K n . Here, we take that ordering as follows. Let (3 : K n — > [0, 1] 



be chosen uniformly at random; order the edges of K n according to their birthtimes /3(f) (which 
are all distinct with probability 1), starting with the edge whose birthtime is smallest. Given the 
ordering, traverse the ordered edges and add each traversed edge to an evolving (initially empty) 
triangle-free graph, unless the addition of the edge creates a triangle. When all edges of K n have 
been exhausted, the process ends. Denote by TF(n) the triangle-free graph which is the result of 
the above process. Further, denote by T¥(n,p) the intersection of TF(n) with {/ : (3{f) < p}. 

For a graph F, let Xp be the random variable that counts the number of copies of F in T¥(n,p). 
We use ep and vp to denote respectively the number of edges and vertices in a graph F and set 
aut(F) to be the number of automorphisms of F. A graph F is balanced if ep/vp > en/vn for 
all H C F with vh > 1- We say that an event holds asymptotically almost surely (a.a.s.) if the 
probability of the event goes to 1 as n — > oo. For mi = mi(n), = rn^in), we write m\ ~ m<i if 
m\jmi goes to 1 as n — > oo. Let Inn denote the natural logarithm of n. Our main result follows. 

Theorem 1.1. Fix a triangle-free graph F and e G (0, 1CP 10 ). For some p ~ n £ ~ 1 ^ 2 , 
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Our second result gives a concentration result for Xp, for certain fixed triangle- free graphs F. 

Theorem 1.2. Fix a balanced triangle-free graph F with epjvp < 2. Then there exists < £f < 
1CP 10 such that for all e £ (0,ep) the following holds. For some p ~ n 2-1 / 2 , a.a.s., 

^ Vp\ ( n \ /lnn e 

aut(F) \vf J \ n 

One interesting point worth making with respect to Theorem 11.21 is this. Let F be a balanced 
triangle-free graph with density ep/vp < 2. Fix e 6 (0, £p), where is as guaranteed to exist 
by Theorem 11.21 Let p ~ n e_1//2 be as guaranteed to exist by Theorem 11.21 Consider the random 
graph G(n, m), which is chosen uniformly at random from among those n- vertex graphs with 
exactly m := [2~ 1 n 3 ^ 2 \ / ln n e \ edges. Note that by Theorem 11.21 T¥(n,p) and G(n, m) a.a.s. has 
asymptotically the same number of edges. This of course follows directly from our choice of the 
parameter m. The point is that by standard techniques and by Theorem 11.21 we also have that 
a.a.s., the number of copies of F in G(n, m) is asymptotically equal to the number of copies of F in 
T¥(n,p). Furthermore, G(n,m) is expected to contain many triangles, and indeed it does contain 
many triangles a.a.s., whereas T¥(n,p) contains no triangles at all. Therefore, one may argue, 
at least with respect to the number of copies of fixed balanced triangle-free graphs with density 
strictly less than 2, that T¥(n,p) "looks like" a uniformly random graph with m edges-only that 
it has no triangles. A similar point can be made with respect to Theorem ll.il 




1.1 Related results 

Erdos, Suen and Winkler [5] were the first to consider the triangle-free process. They proved 
that the number of edges in TF(n) is a.a.s. bounded by Q(n 3 / 2 ) and 0(n 3//2 lnn). Spencer |12j 
showed that for every two reals a±,a2 > 0, there exists no such that the number of edges in T¥(n) for 
n > no is expected to be at least ain 3 / 2 and is a.a.s. at most a2n 3 / 2 Inn. In the same paper, Spencer 
conjectured that the number of edges in TF(n) is a.a.s. nn). In a recent breakthrough, 

this conjecture was proved valid by Bohman [2]. We remark that Theorem 11.21 generalizes Bohman's 
lower bound for the number of edges in TF(n) and answers a question of Spencer [13j . We discuss 
in some more details Bohman's result below. 

Other results are known for the more general H-hee process. In the H-iiee process, instead 
of forbidding a triangle, one forbids the appearance of a copy of H. Let M.(H, n) be the graph 
produced by the H-free process. There are several results with regard to the number of edges in 
M(H,n) mUnnilllllll]. For a graph H ^ K 3 that is strictly 2-balanced, the best lower bounds 
(which are probably optimal) on the number of edges in M.(H, n) are provided by Bohman and 
Keevash f3j; the best upper bounds on the number of edges in M.(H, n) are provided by Osthus and 
Taraz |10| and are within poly(lnn) factors from the best lower bounds. 

Lastly, in [2JEH12] , the authors consider the independence number of M.(H, n) for some graphs 
H. Most notable are the results of Bohman [2] and of Bohman and Keevash [3J. Bohman studies 
the independence number of M(iJ, n) for H £ {^3,^4}. His results imply Kim's [8] celebrated 
lower bound on the off-diagonal Ramsey number r(3, t) and a new lower bound for r(4, t). Bohman 
and Keevash extend Bohman's results for every H that is strictly 2-balanced. By that, they obtain 
new lower bounds for the off-diagonal Ramsey numbers r(s,t) for every fixed s > 5. 
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1.2 Comparison with Bohman's argument 

Bohman's analysis of the triangle-free process in [2] shows that the number of edges in TF(n) is 
a.a.s. f2(ra 3//2 \/lnn). Theorem 11.21 generalizes this result in that it matches Bohman's lower bound 
up to a constant and in addition provides an a.a.s. lower bound on the number of copies of F in 
TF(n), for every fixed F that is a balanced triangle- free graph with density less than 2. Moreover, 
Bohman's result implies a lower bound of f2(n 3//2 Vhin) on the expected number of edges in TF(n). 
Theorem 11.11 generalizes this result in that it matches Bohman's lower bound up to a constant and 
provides a lower bound on the expected number of copies of F in TF(n) for every fixed triangle-free 
graph F. Below we shortly discuss and compare Bohman's argument and ours. 

Bohman uses the differential equations method in order to analyse T¥(n,p) for p = n £ ~ 1 / 2 and 
some fixed e > 0. The basic argument can be described as follows. First, a collection of random 
variables that evolve throughout the random process is introduced and tracked throughout the 
evolution of T¥(n,p). This collection includes, for example, the random variable |Oj|, where Oi 
denotes the set of edges that have not yet been traversed by the process, and which can be added to 
the current graph without forming a triangle, after exactly i edges have been added to the evolving 
graph. Now, at certain times during the process (i.e., at those times in which new edges are added 
to the evolving graph), the expected change in the values of the random variables in the collection is 
expressed using the same set of random variables. This allows one to express the random variables 
in the collection using the solution to an autonomous system of ordinary differential equations. 
It is then shown that the random variables in the collection are tightly concentrated around the 
trajectory given by the solution to this system. The particular solution to the system then implies 
that \Oj\ is a.a.s. large for / = f2(re 3 / 2 \/ln n). This then implies the a.a.s. lower bound on the 
number of edges in TF(n). 

In comparison with the above, we analyse T¥(n,p) for p = n e-1 ' 2 and some fixed e > using the 
original branching process approach of Spencer [12] together with the semi-random method. These 
two are combined together using combinatorial arguments. Apart from our different approach for 
the analysis of the triangle-free process, our actual argument is more direct, in the sense that we 
estimate directly the probability that any fixed triangle-free graph F is included in T¥(n,p). Doing 
so allows us to infer the validity of our two main results using standard techniques. 

We remark that in the course of our analysis, we track and show the concentration of some 
random variables that in retrospect (and perhaps not surprisingly) turned out to be essentially the 
same random variables as some of those that were tracked by Bohman. We choose to keep this 
part of the proof both for the sake of completeness and since it provides an alternative argument 
for the concentration of these random variables. 

Lastly, we note that exactly like Bohman's argument, our ideas can be generalized so as to 
obtain results which are similar in spirit to our main theorems for the more general H-free process 
for a large family of graphs H. Moreover, since our arguments allow us to reason about subgraphs 
other than edges in the evolving graph, we can prove results of the following form: "a.a.s. every 
set of t vertices in M.(H, n) spans a copy of F v for some t and some fixed graphs F. In particular 
for H = K4, we can use the ideas presented in this paper in order to show that a.a.s. every set of 
t = 0(n 3 / 5 (lnn) 1//5 ) vertices in M(i^4,n) spans a triangle. This implies an a.a.s. upper bound on 
the number of edges in M(i^4,n) which matches up to a constant Bohman's lower bound. 
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2 Preliminaries 



2.1 Notation 

As usual, for a natural number a, let [a] := {1, 2, . . . , a}. We write x = a(y =t if it holds that 
x £ [a(y — z) b , a(y + z) ]. We also use a(y ± z) b to simply denote the interval [a(y — z) b , a(y + z) b \. 
All asymptotic notation in this paper is with respect to n — > oo. All inequalities in this paper are 
valid only for n > uq, for some sufficiently large no which we do not specify. 

2.2 Azuma's inequality 

The following result is a version of Azuma's inequality [6], tailored for combinatorial applications 
(see e.g. [TIE] ) ■ Let ai,«2; • • • be independent random variables with Oj taking values in a 
set Ai. Let ip : A\ x A<i x . . . x A m — > R satisfy the following Lipschitz condition: if two vectors 
a, a' £ 4] x ^2 x • • ■ x A m differ only in the ith coordinate, then \ip(ot) — ip{a')\ < Ci. Then the 
random variable X = ip(ai, ct2, • • • , a m ) satisfies for any t > 0, 



In this section we prove Theorems 11.11 and 11.21 modulo one technical result. We begin by giving 
an alternative definition of the triangle- free process. Under this alternative definition, we state a 
result (Theorem 13. ip which trivially implies Theorem I l.li We then use this result in order to prove 
Theorem 11.21 The rest of the paper will then be devoted for proving the above mentioned result. 

Fix once and for the rest of the paper e 6 (0, 10~ 10 ). Define 5 := 1/|_^ £ J and / := 5~ 2 . For every 
integer i > define a triangle- free graph TFj as follows. Initially, take TFo to be the empty graph 
over the vertex set of K n and set Bq := 0. Given TFj, define TFj + i as follows. Choose uniformly 
at random a function : K n \ B<i — > [0, 1] where B<i := Uj<j Bj- Let Bi + \ be the set of edges 
/ for which the birthtime /3j+i(/) satisfies /3j+i(/) < 5n~ l l 2 . Traverse the edges in Bi + \ in order of 
their birthtimes (starting with the edge whose birthtime is smallest), and add each traversed edge 
to TFj, unless its addition creates a triangle. Denote by TFj + i the graph thus produced. Observe 
that TF/ has the same distribution as TF(ra, j>) for some p ~ n e_1 / 2 . 

Let &(x) be a function over the reals, whose derivative is denoted by </>(x), and which is defined 
by 4>(x) := exp(— <I>(x) 2 ) and $(0) := 0. This is a separable differential equation whose solution (tak- 
ing into account the initial value) is given implicitly by ^erfi(<I>(x)) = x, where erfi(x) is the imag- 
inary error function, given by erfi(x) := ex.p(t 2 )dt. We have that erfi(x) — > exp(x 2 ) / '{^/ttx) 

as x — > oo. Hence, it follows that $(x) — ► Vlnx as x — > oo. 

By the discussion above, linearity of expectation and the fact that the number of copies of F 
in K n is au " t ^F) {y F ) the following result trivially implies Theorem I l.li 



Pt[\X -E[X] \>t}< 2exp 




3 Proof of Theorems [EI and LQ] 
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Theorem 3.1. Let F C K n be a triangle-free graph of size 0(1). Then 

Pr[F C TFj] ^ ' 1 ; 



For a graph F, let be the random variable that counts the number of copies of F in TFj. 
The following theorem clearly implies Theorem 11.21 

Theorem 3.2. Fix a balanced triangle-free graph F with epjvp < 2. Then there exists < £p < 
10 -10 such that for all e G (0, ep) the following holds. A.a.s., 

aut(F)\v F J\ y/H J 

Proof. Fix a balanced triangle-free graph F with cf/vf < 2. Assume e £ (0,£p) for some < 
ep < 10~ 10 sufficiently small so that it satisfies our arguments below. The number of copies of F 
in K n is ^m{v F )- Therefore, by Theorem EH 

[ Fl aut(F)VWV J ' 
To complete the proof, it suffices to show that Yp is concentrated around its mean. For that we 
use Chebyshev's inequality (see e.g. [1]). Thus it remains to show that Var(Yp) = o(E[Yp] 2 ). 

For G C K n , let Iq be the indicator random variable for the event {G C TFj}. We have 

Var(») = Cov(I G , I&) = ^ E[J G J^] - E[I G ] E[I G ,] , 
G,G' G,G' 

where the sum ranges over all copies G, G' of F in K n . We partition the sum above to two sums 
and show that each is bounded by o(E[Yp] 2 ). First, let ^ G G , be the sum over all copies G, G' of F 
in K n such that G and G' share no vertex. If G and G' share no vertex then GL)G' is triangle- free. 
Hence, since the number of two vertex-disjoint copies of F in K n is asymptotically equal to the 
number of copies of F in K n squared, it follows from Theorem 13.11 that 

Next, we will make use of the following observation: if G, G 1 are two copies of F in K n with 
GnG' being isomorphic to H, then E\IqIqi\ = 0((n £ ~ 1 / 2 ) 2eF ~ eH ). This is true since the event 
{G,G' C TF/} implies {GuG'C B</} and indeed, Pr[G UG'C B<j] = 0((n £ - 1 /2)2e J? -e H )_ Let 
be the sum over all H C F with wjj > 1. Let YliGr\G'=H De ^ ne sum over au c °pi es C, G' of F 
in X n that share at least 1 vertex such that G n G' is isomorphic to -ff. Then by the observation 
above, 

Y, E Cov(7 G! / G 0<O(n 2 ^-^)-(n £ - 1 / 2 )^-e„ ! 
-ff GnG'=H 

which, since F is a fixed balanced graph with cf/vf < 2, is at most o(E[Yp] 2 ) if e £ (Q,sf) and 
Ep is sufficiently small. This implies the desired bound on Var(Yp). ■ 

It remains to prove Theorem 13.11 In the following section we state two technical lemmas that 
will be used to prove Theorem 13. 11 The actual proof of Theorem 13. II is given in Section The rest 
of the paper will then be devoted for the proof of these technical lemmas. 
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4 Technical lemmas 



Here we state (and partly prove) two technical lemmas that will be used to prove Theorem 13.11 

We begin with some definitions. For every edge g G K n and for every < i < I, j € {0, 1, 2}, 
define Aj(g,i) as follows. Let Ao(<?, i) be the family of all sets {51,(72} ^ TFj such that {5,51,52} 
is a triangle. Let Ai(5,i) be the family of all singletons {51} Q K n \ B<i such that there exists 
52 S TFj for which {5,51,52} is a triangle and it holds that TFj U {51} is triangle- free. Let A2(5,i) 
be the family of all sets {51 , 52} C K n \ B<i such that {5, 51 , 52} is a triangle and for which it holds 
that TFj U {gj} is triangle- free for both j G {1,2}. 

Definition 1. For every < i < I, let 

7(i) := max{5$(i(5)0(i<5), 5 2 ^{i5) 2 }, 
T(i) := 



n" 30£ if i = 0, 

T(i - 1) • (1 + 107(3 - 1)) i/i>l. 



Our first technical lemma tracks the cardinalities of Aj(g,i). 
Lemma 4.1. Let < i < I. Suppose that given TFj, we have 



Vg e K n . 


|A (5,i)| 


< in 5e , 


Vg e K n . 


|Ai(s,*)| 


< iy/n, 


V5 i B<i. 


|Ai(5,i)l 


= 2Vn*(t5M*tf) • (1 ± r(»)), 


V5 i B<i. 


|A 2 (^,»)I 


= n0(i5) 2 • (l±r(t)). 


Then with probability 1 — n - ^ 1 -*, 






Mg£K n . \A (g,i 


+ 1)1 < 


(* + l)n 5e , 


Vg £ K n . \Ai(g,i 


+ 1)1 < 


(i + l)Vn, 


Vg<£B< i+1 . \At(g,i 


+ 1)1 = 


2V«*((t + + 1)5) • (1 ± r(i + 1)) 


Vg(£B< i+1 . \A 2 (g,i 


+ 1)1 = 


r»0((i + 1)5) 2 • (l±T(i + l)). 



The following fact will be used in several places in our proofs, either explicitly or not, and its 
proof is given in Appendix [Al 

Fact 4.2. For allO<i<I, 

(i) 1 > <t>(i$) = ft(n- L5£ ); $(t<5) < Inn; i > 1 = 0(n" £ ). 

(ii) 7(1) = o(l); 7(2) = 0(n- 5e ); n~ 30e < T(i) < n- 10 + 



4.1 Proof of Lemma 14.11 

Fix < i < I and assume that the precondition in Lemma 14.11 holds. We prove that each of the 
consequences in Lemma 14.11 hold with probability 1 — n _w( - 1 \ Along the way we state a useful 
lemma that, together with Lemma 14.11 will be used to prove Theorem 13.11 in the next section. 
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For any g £ K n , assuming |Ao(<?, i)| < in 5e , we trivially have that |Ao(<7, i + 1)| < m 5e + Aq(#), 
where \o{g) is the number of sets {gi} £ Ai(g,i) for which it holds that g\ £ -Bj+i, plus the 
number of sets {51,52} £ ^-2(9, *) for which it holds that 51,32 S -Bj+i- Given the precondition 
in Lemma 14. 14 the fact that ^2(3, i)| < n, the definition of i?i+i and the fact that i < J, it is 
clear that E[Ao(<?)] = o(n 5e ). Hence, by Chernoff's bound we get that with probability 1 — n^^^, 
Ao(<?) < n5e - This implies that with probability 1 — n~^^ l \ for all g £ K n , |Ao(g,i + l)| < (i + l)n 5e . 

Next, for any g £ K n , assuming \A\{g, i)\ < i^/n, we trivially have that \Ai(g, i + 1)| < 
iy/n + \\{g) , where \\{g) is the number of sets {31,32} €E ^-2(9, i) fo r which it holds that 31 £ -Bj+i 
and 32 ^ Bi + i. By the fact that ^2(3, i)\ < n and by the definition of -Bj+i, it is clear that 
E[Ai(3)] = o{^/n). Hence, by Chernoff's bound we get that with probability 1— n~ ui ^\ Ai(3) < ^/n. 
This implies that with probability 1 — n~^^\ for all 3 £ K n , |Ai(3,i + 1)| < (i + l)y/n. 

Remark 4.3: The only reason we are interested in maintaining the above upper bound on the 
cardinality of A±(g,i) for all 3 £ K n and i, is that we need this upper bound in order to maintain 
an upper bound on the cardinality of Ao(3,i) for all 3 £ K n and i (as we did above). We will not 
make any further use of the above upper bound on Ai(3,i). 

Having dealt with the easy cases first, we now turn to deal with the two last, more involved 
consequences in the lemma. 



4.1.1 Definitions and an observation 

Definition 2 (Redefinition of Pi+i). Define M := n 20000£ . £ e i B* +1 be a random set of edges, 
formed by choosing every edge in K n \B<i with probability Mn^ 1 ! 2 . For each 3 £ B* +1 , let (3i + \(g) 
be distributed uniformly at random in [0, Mn^ 1 / 2 ] and for each 3 £ K n \ (i?<j U B* +1 ), let (3i + i(g) 
be distributed uniformly at random in {Mn^ 1 / 2 , 1]. 

Clearly, the above definition of is equivalent to the original definition of A+i, given at 
Section [3J Note that the definition of B.- L+ i is not changed and that -Bj+i C B* +1 . 

Let A* (3, i) be the family of all G £ Aj(g, i) such that G C B* +1 . Let A£*(3, i) be the family of 
all G £ A 2 (g, i) such that \G n B* +1 \ = 1. 

Definition 3. Let 3 G K n \ B<i, I £ N. We define inductively a labeled rooted tree T* ; of height 
21. The nodes at even distance from the root will be labeled with edges from K n \ B<i. The nodes 
at odd distance from the root will be labeled with sets of j £ {1,2} edges from K n \ B<i. 

— The root vq ofT*^ is labeled with the edge 3. 

— For every G £ A^(3,z) U A^g^i) do: set a new node u\, labeled G, as a child of vq; 
furthermore, for each edge gi £ G set a new node v\, labeled 31, as a child of u\. 

• I > 2; We construct the tree T* ; by adding new nodes to T* l _ 1 as follows. Let 

(vq, u\, V\, . . . , itj_i, vi—i) be a directed path in T* l _ l from the root vo to a leaf vi-\. Let 
gj be the label ofvj. 
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— For every G G A^ffi— /or which g\^i ^ G do; se£ a new node u\, labeled G, as a child 
ofvi_i; furthermore, for each edge gi G G set a new node vi, labeled g\, as a child of u\. 

— For every G G A*(<7j_i, i) for which gi_ 2 $ G and GU {gi-i, gi-2} isn't a triangle do: set 
a new node ui, labeled G, as a child of ' v\-\; furthermore, for the edge g\ G G set a new 
node vi, labeled gi, as a child of u\. 

Lastly, for GcK n \B< i> define T*j := {T*, : g G G}. 

Consider the tree T* ; . Let v be a node at even distance from the root of T* t . Let /q be 
the label of v. We define the event that v survives as follows. If v is a leaf then v survives by 
definition. Otherwise, v survives if and only if for every child u, labeled G, of v, the following 
holds: if /?j + i(/) < min{/3j +1 (/ ), 5n~ 1 / 2 } for all / G G then u has a child that does not survive. 
For g ^ B<i, let A 9t i be the event that the root of T* z survives. Let Ag,1 '■= OgzG-AgJ- Given 
Definition [31 the following is an easy observation. 

Proposition 4.4. Let I > 1 be an odd integer. 

• Conditioned on {g G 5 i+1 ,TFj U {g} is triangle-free} , 

Ag,i => {g g TF m } => Ag,l+1. 

• Conditioned on {g ^ S<j+i,TFj U {g} is triangle-free}, 

Ag.i => {TF i+ i U {g} is triangle-free} =>■ A g ,i+x. 

4.1.2 Proof of Lemma 14.11 

Let £ * be the event that the following properties hold: 
PI For every g ^ -B<j, 

|AJG/, i)| = 2M$(iS)</>(iS) ■ (1 ± (m + o(T(i) 7 (i)))), 
\A* 2 (g,i)\ = M 2 ^) 2 -(l±(r« + o(r(i) 7 (,)))), 
\Af(g, i)| = 2Myfaf>(iS) 2 • (1 ± (r(0 + o(T(0 7 (0))). 

P2 For every three distinct vertices uu, x, y, if {w, x}, {x, y} ^ B<i\ 

— The number of vertices z such that {w,z},{y,z} G TFj and {x,z} G is at most 
(Inn) 2 . 

— The number of vertices z such that {w,z} G TFj and {x, z}, {y, z} G is at most 
(Inn) 2 . 

— The number of vertices z such that {uu, z} G TFj, {x,z} ^ B<i and {y,z} G is at 
most M 2 . 

P3 For every two distinct vertices x,y, the number of vertices z such that {x,z},{y, z} G -B* +1 
is at most 2M 2 . 
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P4 For every vertex x, the number of edges {x, y} £ B* + x is at most 2My/n. 

Fix once and for the rest of the paper L £ {40,41}. The following is our second technical 
lemma, which is proved in Sections [6H3 

Lemma 4.5. 

• Pr[£*\ = l-n -w W. 

• Let F C -fi'n, \ B<i be a triangle-free graph of size O(l) such that TFj U F is triangle- free. 
Assume B* +1 was chosen and condition on the event that£* holds. Also condition on the event 
that a\ edges of F are in Bi + \ and that a<i edges of F are not in -Bj+i (so that \F\ = a\ + a 2 ). 
Then 

+ - <5>{i5)\ ai ( <p{{i + 1)5)^ a2 



Corollary 4.6. Suppose the settings and assumptions in the second item in Lemma \4-5\ hold. 
Further assume that g £ B<i U F and condition on {g ^ Bi + {\. Then 



Proof. Suppose the settings and assumptions in the second item in Lemma 14.51 hold and let g ^ 
-B<jUi ? . Without conditioning on {g ^ £?j + i}, the corollary follows trivially from Lemma [4.51 only 
with the constant 4.01 being replaced by 4. Now note that we have Pr[g ^ -Bi+i] > 1 — (5M _1 > 
1 — o(r(i)7(i)), where the second inequality is by Fact 14.21 This gives the corollary, since given 
£*, Pt[Af,l] = 1 — (Indeed, Af,l is implied by the event that for all / £ F and for all 

G £ A*(f,i), j £ {1,2}, there is an edge g £ G which is not in Bi + \. Given £* this occurs with 
probability 1 — o(l).) ■ 

For the rest of the section we assume that we have already made the random choices that 
determine the set B* +1 . We also assume that £* holds and keep in mind the fact that this event 
holds with probability 1 — n - "^ 1 ) . We further fix for the rest of the section an edge g ^ B<i and 
condition on the event {g ^ -Bj+i}. We estimate the cardinalities of Aj(g,i + 1) for j £ {1,2}. 

We define random variables that will be used to estimate the cardinality of Aj(g,i + 1) for 
j £ {1,2}. Let Xi(g,l) be the number of sets {gi} £ A\(g,i) for which it holds that gi ^ -E>i+i and 
A git i occurs, plus the number of sets {gi,g 2 } £ A^^ijuAf^, i) for which it holds that g\ £ -Bi+i, 
g2 £ Bi + i, and A gi) i Pi A 92t i occurs. Let \2(g, I) be the number of sets {51, 52} £ A 2 (g, i) for which 
it holds that 51,52 ^ -Bi+i and A gi j D A g2: i occurs. 

By definition of Aj(g, i + 1) and by Proposition 14.41 we have for odd / > 1, 

Ai(<M) < |Ai( 5 ,i + l)| < \ x {g,l + l), 
A 2 (<M) < |A 2 (s,t + l)| < X 2 (g,l + 1). 

Note that since g £ B<i, we have for all F £ A±(g, i) U A 2 (g, i) that TFj U F is triangle- free. Using 
this fact, we can use Corollary 14.61 together with the precondition in the lemma, Fact 14.21 and the 
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fact that Pr[/ ^ B i+1 ) > 1 - 8M~ l to verify that 

E[Ai( 5 ,L)] = 2VH*((i + l)^((* + l)<y)-(l±(r(i) + 9r(i)7(i))) ! 
E[A 2 ( ff ,L)] = n^((i + l)5) 2 .(l±(r(i) + 9r(i)7(*)))- 

We complete the proof by giving concentration results for \j(g, L), j £ {1, 2}. The required bound 
on the cardinality of Aj(g, i + for all g ^ -B<«+i will then follow from these concentration results, 
together with a union bound argument. 

Concentration of \±(g, L): Let Si be the set of edges which is the union of the sets in A±(g, i), 
A^{g, i) and A^^g, i). Let S 2 be the set of all nodes in the trees / £ Si, where Tj ^ is defined 
to be the tree that is obtained as follows: cut off from L every subtree that is rooted at a node 
having a child that is labeled g. Let 53 D Si be the set of edges that are labels of nodes in S 2 . By 
the precondition in Lemma 14.11 and £*, we have that |Si| < M 2 n 1 / 2 and that every tree Tj ^ has 
at most 0(M 2L ) < n 1 / 1000 nodes. Therefore, \S 3 \ < \S 2 \ < M 2 n 1/2+1/1000 . Observe that since we 
condition on {g ^ -Bj+i}, we have that for f £ Si, Af t L depends only on the birthtimes of the edges 
that are labels in Tf 1,. Hence, since Si C S3 we have that \\{g,L) is determined by the birthtimes 
of the edges in S3. We argue below that every edge in S3 appears as a label in at most n 1 / 1000 trees 
Tf t L, f £ Si. This implies that changing the birthtime of a single edge in S3 can change Xi(g, L) 
by at most n 1 / 1000 . It will then follow from Azuma's inequality, the bound above on the number 
of edges in S3, the bound on E[Ai(g, L)\ and Fact 14.21 that, as needed, with probability 1 — n~ u ( l \ 

X 1 (g, L) = 2^$((* + l)S)cf>{(i + 1)5) • (1 ± F(i + 1)). 

We argue that every edge in S3 appears as a label in at most n 1 / 1000 trees f £ Si. For 

g',g" 6 S3, say that g' affects (resp. directly- affects) g" if there is a tree Tf t L, f £ Si, with a path 
(resp. path of length or 2) leading from a node labeled g" to a node labeled g' . It is enough to 
show that every edge in S3 affects at most j^V 1000 edges in Si. 

Fix g' £ S3. By Definition [3j if g" £ S3 \ Si then g" £ B* +1 . Therefore, the number of edges 
g" £ S3 \ Si that g' directly-affects is at most \A*(g',i)\ + ^{g' ,i)\ + 1 = 0(M 2 ), where the upper 
bound is by £*. If g' shares no vertex with g then it is clear that g' directly-affects at most 5 
edges in Si . If g' shares exactly 1 vertex with g then one can verify that given the precondition in 
Lemma 14. II and £* (specifically by PI, P2 and P3), g' directly-affects at most 0(M 2 ) edges in Si. 
This covers all possible cases since g' 7^ g by definition of S3. We conclude that every edge in S3 
directly-affects 0(M 2 ) edges in S3. Since a path in Tf^ has length at most 2L, and the edges that 
are labels along such a path are all in S3, we get that every edge in S3 affects 0(M 2L ) < n 1 / 1000 
other edges in S3. Since Si C S3 we are done. 

Remark 4.7: In the argument above, it was essential that we condition on {g ^ Bi + \}. Had 
we not done that, it would be the case that changing the birthtime of g would change X\(g,L) 
potentially by at least |Ai(g, This affect is too large, as it will render Azuma's inequality 
useless in providing us with the concentration result we seek. 

Concentration of \2{g, L): Let Si be the set of edges which is the union of the sets in A 2 (g, i). 
Let S2 be the set of all nodes in the trees Ti L , f £ S\. Let S3 D Si be the set of edges that are 
labels of nodes in S 2 . Trivially, |Si| < 2n. Also, by £* every tree T~f L has at most 0(M 2L ) < n 1/wo ° 
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nodes. Therefore, IS3I < IS2I < 2re 1+1 / 1000 . Observe that \2(g,L) is determined by the birthtimes 
of the edges in S3. We argue below that there is a set of at most M 2 n 1//2+1//1000 edges in 53, each 
of which is a label in at most M 2 n 1 / 2+1 / 1000 trees TJ L , f G Si, and that every other edge in S3 is a 
label in at most n 1//100 ° trees TJ L , f G Si. It will then follow from Azuma's inequality, the bound 
above on the number of edges in S3, the bound on E[A2(<7, B)] an d Fact 14.21 that as needed, with 
probability 1 — n - ^ 1 ), 

X 2 (g,L) =n«/»((i + l)«5) 2 -(l±r(i + l)). 

Define affects and directly- affects exactly as above. It is enough to show that there is a set of at 
most M 2 n 1 / 2+1 / 1000 edges in S3, each of which affects at most M 2 n 1 / 2+1 / 1000 edges in Si, and that 
every other edge in S3 affects at most n 1 / 1000 edges in Si. 

For a fixed edge g' G S3, we collect a few useful observations. First assume that g' ^ B* +l . 
Then by Definition [3l we must have that g' G Si and that g' appears as a label only at the root 
of T*, L . Therefore, if g' £ B* +1 then g' affects (and directly-affects) only g' . Next assume that 
g' G B* +1 . By £* we have that g' directly-affects at most \A\(g',i)\ + lAjj (</,«) I + 1 = 0(M 2 ) edges 
g" G SsnB* +1 D Ss\Si. If g' shares no vertex with g then g' clearly directly-affects at most 5 edges 
in Si. If g' shares at least one vertex with g then it follows from the precondition in Lemma 14.11 
and £* (specifically by P4) that g' directly-affects at most M 2 n l l 2 edges in Si. Lastly we note that 
for every f E Si the following holds: every edge that is a label in Tt L , except perhaps for /, is in 

•v •/>';.:• 

Say that g' is a bad-edge if g' affects an edge g" £ S3 Pi B* +1 that shares at least one vertex 
with g. From the observations in the previous paragraph, it follows that if g' is a bad-edge then 
g' affects at most M 2 n 1/2 • 0(M 2L ) < M 2 n 1 ' ' 2+1 ' ' 100 ° edges in Si; on the other hand, if g' is not 
a bad-edge then g' affects at most 0(M 2L ) < n 1//100 ° edges in Si. It thus remains to bound the 
number of bad-edges. By £* there are at most M 2 n l l 2 edges g" G B* +l that share at least one 
vertex with g. In addition, by £*, for every edge in S3 there are at most 0{M 2L ) < n 1 / 1000 other 
edges that affect it. Hence, there are at most M 2 n 1 / 2+1 / 1000 bad-edges. With that we are done. 



5 Proof of Theorem 13.11 

Let F C K n be a triangle-free graph of size O(l). Say that the triangle-free process well-behaves if 
for every < i < I, the precondition in Lemma 14 . 1 1 holds . Note that for i = the precondition in 
Lemma 14.11 holds trivially. Hence, by Lemma 14.11 and the union bound, the process well-behaves 
with probability 1 — n -1 ^ 1 ). 

For < i < I, define 

5 

In this section we will use a to denote a placement {/ G Bi f+ i : / G F}, where < if < I for all 
/ G F. We will show that for every fixed placement a, 

Py[F C TF/, process well-behaves | a] ~ (1) 
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Note that for every placement a, Pr[a] ~ (4^Y F . Taking J2 a to be the 

sum over all possible 

placements a, it will then follow from ([1]) that 

Pr[F C TF/, process well-behaves] = ^Pr[a]Pr[FCTF/ , process well-behaves | a] 

a 

ja) En*) 

v 7 a feF 



n 



where the validity of the last equality is by Claim [5TT1 below. Since the process well-behaves with 
probability 1 — n -1 ^ 1 ) and &(I5) — » oo as n — > oo, it will then follow that, as needed, 

Pr[F C TFj] = + Pr[F C TF/, process well-behaves] ' 

Claim 5.1. £ Q EI/eF ¥>(*'/) = H^Y 



n 



Proof. Let {Zf : f £ F} be a set of mutually independent 0/1 random variables, defined as follows. 
For every / G F, choose uniformly at random an index < if < I and let / S Bi f+ \. Then, let 
Zf = 1 with probability (f{if). (We note that ip(i) £ [0,1] for all < i < I; see Remark 17.31 ) In 
this context, the probability of a placement a is I~ ep . By definition we have 

Pr[V/ € F. Zf = 1] = ]TPr[a]Pr[V/ eFZ f = l\a}= r e ? £ J] pfy). 

On the other hand, by independence and symmetry we have for every fixed g £ F, 
Pr[V/eF. Z f = l] 1 ^ =Pv[Z g = l] =T l V(i 9 ) = (I5)- l <S>(I5). 



It remains to prove ([I]). Fix a placement a. For < % < J, define Fj := Ffl -B<j. For every 
< i < J, define the events: 

Qi(i). The precondition in Lemma 14 . 1 1 holds for i. 

Q 2 (i): TFj U (F \ Fj) is triangle-free. 
Q 3 (0 : Ft C TFj. 

Let Q(i) := Qi(i) n Q 2 (0 n Q 3 («). Note that the event f|o<i</ 2(0 is exactly the event {F C 
TF/, process well-behaves | a}. Therefore, it remains to estimate the probability of C\o<i<i SCO- 
Note that Q(0) holds trivially. The next proposition gives an estimate on the probability that 
Q(i + 1) holds given Q(i). Iterating on that proposition for all < i < I gives ([I]). 



Proposition 5.2. Let < i < I and assume Q(i) holds. Then Q(i + 1) holds with probability 

(l±O(n- 10e )) 



$((i + l)5) -<S>(i5)\ lF * +AF * 1 f0((i + l)5)\ lFXF * +l1 i _. Lll: 
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Proof. Assume that we are given an instance of TFj and that Q(i) holds. Consider the process as 
it creates TFj + i. For the rest of the proof, our context is the one given in Section [4.11 

Since F C K n is triangle-free of size O(l), we have by definition that F \ Fi C K n \ I?<j is 
triangle- free of size 0(1). By Q(i) we have that TFj U (F \ Fi) is triangle- free. Therefore, taking 
a\ = \Fi + \ \ Fi\ and ai = \F\ jPj+i|, it follows from Lemma S3] and Fact 14.21 that 

Let F' be the set of all edges / such that F U {/} contains a triangle and note that \F'\ = O(l). 
Let £' be the event that for every / G F' , {/ ^ TFj + i}. We have that £' is implied by the event 
that for every / G F' , {/ $ B i+1 } occurs. Therefore, Pr[f] > 1 - O^n" 1 / 2 ) > l-n~ We . By 
Lemma HTT] we have Pr[Qi(i + 1)] = 1 — n - ^ 1 ). Thus, since Pr^^^^] = 1 — o(l) (see the proof 
of Corollary 14.61) . it follows that 



All that is remained to observe is that the probability of {Afxf^Lj £', Qi(i + 1)} above is an 
estimation of the probability of Q(i + 1). Indeed, it follows from Proposition 14.41 that if L is odd 
(resp. even) then {Ap\Fi,L> £' ■> Qi(,i + 1)} implies (resp. is implied by) Q(i + 1). ■ 



6 Proof of Lemma 14.51 



Here we prove Lemma [4.51 modulo one lemma whose proof is given in the next section. Our context 
in this section and for the rest of the paper is the one given in Section 14.11 where the lemma was 
stated. That is, we fix < i < I and assume the precondition in Lemma 14 . 1 1 holds . 

The first item in Lemma [4.51 follows from Chernoff's bound, using Fact 14.21 and the precondition 
in Lemma 14.11 Thus it remains to prove the second item in the lemma. For the rest of the paper 
we assume that F C K n \ B<i is a triangle- free graph of size O(l) and that the preconditions in the 
second item of Lemma 14.51 hold. That is, we assume that TFj U F is triangle- free, B*. 1 was chosen 
and £* holds. We also condition on the event that a\ edges of F are in -Bj+i and that 02 edges of 
F are not in Bi + \. We remark that while we do have the set Bf +1 at hand, we have not yet chosen 
the random function $+1. 

The basic idea of the proof is as follows. We need to analyse the event Af,l, and the definition 
of this event calls for a recursive analysis. However, the fact that there could possibly be edges 
that are labels in more than one node in Tp L makes such a recursive analysis difficult. As we 
insist on analysing Af,l recursively, the following observation comes to the rescue. Define m := 
[n 100e J^>(i<5) _1 . Redefine the birthtime function fli+i as follows. Let B* +1 be a random set of edges 
formed by choosing every edge in B* +t with probability mM" 1 ; for each g S B* +1 , let j3i + \{g) be 
distributed uniformly at random in [0, mn" 1 ' 2 ]; for each g E B* +1 \B* +1 , let 0i + i(g) be distributed 
uniformly at random in (mn _1//2 , Mra -1 / 2 ] and for all other edges g £ i?<j, let /3i+i(g) be distributed 
uniformly at random in (Mn~ 1//2 , 1]. Let A*(g,i) be the set of all G G A*(g,i) such that G C B* +l 
and note that -Bj+i C B* +1 C B* +1 . Let Tt L be defined exactly as Tt L only that now we use in 
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the definition A*(g,i) instead of A*(g,i). Define T FL := {T jL : / <E F}. It turns out that with 
a sufficiently high probability, every edge that is a label in T FL is a label of exactly one node in 
Tp L . Moreover, in order to analyse the event Af,l, one only needs to consider the birthtimes of 
the edges that are labels in T FL . This will allow us to analyse Af,l recursively. 

Let £* be the event that for every g £ B<i, 

\A$(g,i)\ = 2rn$(i6)(f)(i5) ■ (1 ± 1.0ir(t)), 
\A* 2 (g,i)\ = m 2 ct>(i8f ■ (1 ± l.Oir(i)). 

Let £ F be the following event: if g is a label of some node at even distance from the root of a tree 
in T FL , then g is the label of no other node at even distance from the root of a tree in T FL . 

The following two lemmas correspond to the basic idea outlined above, and clearly imply 
Lemma 14.51 

Lemma 6.1. Pr[£*,£ F ] = 1 - 0{M~ 1 / 1Q ) > 1 - o(T(i)-f(i)). 
Lemma 6.2. Assume B* +1 was chosen and condition on £* f]£ F . Then 

+ 1)5) - <5>{i5)\ ai (<j>{{i + 1)5)^ a2 



The proof of Lemma 16. II is given below. The proof of Lemma 16.21 is given in the next section. 
6.1 Proof of Lemma 16.11 

By Chernoff's bound we have Pr[£*] = 1 — n~ u ^ l \ Therefore, it is enough to prove that Pr[<?p] > 
1 - M" 1 / 10 . We assume that F is not empty, otherwise the assertion is trivial. For brevity, 
set A*(g,i) := A\{g,i) U A^{g, i) for all g £ K n . When stating that two graphs share a edges (or 
vertices), unless otherwise stated this means that the two graphs share exactly a edges (or vertices). 

Definition 4 (bad-sequence). Let S = (G\, G^, . . . , G{) be a sequence of subgraphs of K n with 
1 < I < 2L. We say that S is a bad-sequence if the following properties hold simultaneously. 

• For every j £ [I]: Gj £ A*(g, i) for some g £ F U Ufe<j Gk- 

• For every j £ [I — 1]; Gj shares \Gj\ vertices and edges with F U \J k< j Gk- 

• Either 

— Gi shares \Gi\ + 1 vertices and at most \Gi\ — 1 edges with F U Ufc</ (*k> or 

— Gi shares \Gi\ vertices and edges with F L) [J k<l Gk- In addition, there is an edge 
{x,y} £ FU\J k<l Gk such that G[ £ A*({x,y},i) and there is an edge in G\, without loss 
of generality {x, z\, with the following property: there is an edge {w, x} £ F U Ufc<; &k 
with w ^ y such that {in, z} £ TFj. 

Let £ be the event that for every bad-sequence S = (Gi, £?2, . . . , G{) there exists j £ [I] such 
that {Gj B* +1 }. The next two propositions imply the desired bound Pr[£J,] > 1 — M -1 / 10 , as 
they state that £ implies £ F and Pi[£] > 1 - M" 1 / 10 . 
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Proposition 6.3. £ implies £* F . 



Proof. Assume £ occurs. We have the following claim. 

Claim 6.4. Let P = (vq, iti, • • • , vl) denote an arbitrary path in T FL , starting with some 
root vq and ending with some leaf. Let Gj be the label of node Uj and let gj be the label of node Vj 
(so that go £ F). Then for every j £ [L\: Gj shares edges with F U \J k< j G k . 

Proof. Suppose for the sake of contradiction that the claim is false, and fix the minimal I £ [L] for 
which G\ shares at least one edge with F U Ufc</ ^k- Consider the sequence S = (Gi, G2, ■ ■ ■ , G\). 
We shall reach a contradiction by showing that S or some prefix of S is a bad-sequence. 

A key observation is this: for all j £ [I — 1], Gj shares \Gj \ vertices and edges with FU|J fc< • G k . 
To see that the observation holds, first note that the minimality of I implies that for all j £ [I — 1], 
Gj shares < \Gj \ — 1 edges with F U \J k< j Gk- In addition, trivially, for all j £ [I — 1], Gj shares 
at least \Gj\ vertices with F U Ufc<j ^fc- These facts together with £ now imply that there cannot 
be j £ [I — 1] such that Gj shares \Gj\ + 1 vertices with F U \J k< j G k . 

Suppose that \Gi\ = 2. By assumption we have that Gi shares at least one edge with FL)\J k<l Gk, 
which also implies that Gi shares + 1 vertices with FU{J k<l G k . Hence, by the key observation 
above, in order to show that S is a bad-sequence and reach a contradiction, it remains to show that 
Gi shares 1 = \Gi \ — 1 edge with FL)[j k<l G k . Suppose on the contrary that Gi shares both of its 2 
edges with F U Ufc<z @k- Notice that since F is triangle-free, this implies that I > 2, so gi_2 is well 
defined. Write <?;_2 = {x,y} and gi_i = {x,z} and note that z ^ {x,y}, Gi^i £ A*(gi_ 2 ,i) and 
Gi £ A*(gi_i,i). Now, note that the edge in G\ that is adjacent to z must also be an edge in Gi-\. 
This is true since otherwise, G;_i will share the vertex z with FU Ufc<z-i ^fc> which is clearly not 
the case as by the key observation above shares only vertices from {x,y} with FL)\J k<l _ 1 G k . 
The only possible edge to be adjacent in Gi to z and be in is the edge {y, z}. Hence we get 
that y is a vertex of Gi. Therefore, we conclude that gi~2 G Gi- But by the definition of Tp L , 
9i-2 $ G\. Thus, Gi shares 1 edge with F U Ufc<z G k as needed. 

Next assume that = 1. By assumption we have that Gi shares its edge with F U IJfc<z G k . 
Since TFj U F is triangle-free, this implies that / > 2 and so gi_2 is wen defined. Let x, y, z be as 
defined in the previous paragraph. Note that either x or z are vertices of Gi. First we claim that z 
cannot be a vertex of G\. Indeed, if z was a vertex of G[ then by a similar argument to that in the 
previous paragraph we get that Gi must be the edge {y, z}. But this implies that {gi-2, 9i-i, 9i} 
is a triangle and thus contradicts the definition of T FL . Therefore, j; is a vertex of Gi. We next 
argue that (Gj) 1 ^ is a bad-sequence, and by that get a contradiction. Note that {x, y} is an edge 
in F U\J k<l _ 1 G k such that £ A*({x,y},i) and that {x,z} is an edge in Gz_i. Let {w,x} be 
the edge in G[. By definition of Tp L we have that w ^ {x,y,z}. This implies, since we assume 
that {w, x} is an edge in F U IJfc<z G k , that {w, x} is an edge in F U Ufc</-i Cfc- Since \G[\ = 1 we 
have that {w,z} £ TFj. With the key observation above it now follows by definition that {Gjf^}^ 
is a bad-sequence. ■ 

The next claim, when combined with Claim [6T4"l implies the proposition. 
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Claim 6.5. Fix 1 < I < L and let u be a node at distance 21 — 1 from a root in T FL . Fix 1 < I' < I 
and let v! be a different node at distance 21' — 1 from a root in T F L . Then the labels of u and u' 
share edges. 

Proof. The proof is by induction on I. For the base case I = 1, let u and u' be two distinct nodes 
at distance 1 from the roots of T FL . Let G and G' be the labels of u and u' respectively. Assume 
for the sake of contradiction that G and G' share at least one edge. We claim that either (G) or 
(G,G') is a bad-sequence thus reaching the desired contradiction. To see that this indeed holds, 
note first that by Claim [674l G shares |G| vertices and edges with F and G' shares \G'\ vertices 
and edges with F. Let v and v' be the parents of u and u' respectively. Since G and G' share at 
least one edge and u ^ u' , we have that v ^ v' . Therefore G and G' share exactly 1 edge. Now, if 
\G'\ = 2 it follows that G' shares \G'\ + 1 vertices and \G'\ — 1 edges with FUG; this implies that 
(G,G') is a bad-sequence. Next, assume \G'\ = 1. Let {x,y} G F and {w,x] S F be the labels of 
v and v' respectively. Let z be the vertex of G and G' that is not in F so that G and G' share the 
edge {x,z}. Clearly w ^ y. In addition, since \G'\ = 1 we have that {w,z} £ TFj. It follows that 
(G) is a bad-sequence. 

Fix 2 < I < L and assume the claim is valid for I — 1. Let u be a node at distance 21 — 1 from 
a root in T FL . Fix 1 < l' < I and let v! be a different node at distance 21' — 1 from a root in Tp L . 
Assume for the sake of contradiction that the label of u shares at least one edge with the label of 
v! . Without loss of generality we further assume that I' is minimal in the following sense: the label 
of u shares edges with the label of every node at odd distance less than 21' — 1 from the root of 
T FL . By Claim [631 we may also assume that u' is not a node on the path from a root to u in T FL . 

Let P be the unique path from a root to u in T FL . Let P' be the longest unique path in 
T FL that ends with u' and which do not contain a node from P. Traverse the nodes along the 
path P and then traverse the nodes along the path P 1 , ending each traversal at the nodes u and 
v! respectively. Let (u\,U2, ■ ■ ■ ,u s ) be the nodes so traversed that are at odd distances from the 
roots of the forest, in order of their traversal. By construction, ui = u and u s = u' . We note 
that 2 < s < 2L. Let Gj be the label of node Uj and set S± = (G%, G2, ■ ■ ■ , G s ). Let S2 = 
{G\, G2, ■ ■ ■ , Crj-i, G1+1, G; + 2, • • • , G s _i, Gi). In words, S2 is obtained from Si by first removing Gi 
and G s and then concatenating G[ to the end of the new sequence. Note that Gi and G s are the 
labels of u and u' respectively and that by assumption G\ and G s share at least one edge. We show 
below that either S\ or S2 is a bad-sequence and by that reach the desired contradiction. 

Assume that \G S \ =2. We show that Si is a bad-sequence. By Claim WM the minimality of /' 
and the induction hypothesis we have that for every j G [s— 1], Gj shares edges with -F 1 U(J fc<J - G\~. 
Therefore, by £ we also have that for every j £ [s — 1], Gj shares \Gj\ vertices with F U \J k< j G\~. 
Let V[ be the parent of u\ and v s the parent of u s . Let g\ be the label of vi and g s the label of v s . 
Since ui 7^ u s and yet Gi and G s share at least one edge, we get that v\ ^ v s . This implies by 
Claim HT4l and the induction hypothesis that gi 7^ g s . This, in turn, implies that Gi shares exactly 
1 edge with G s . In what follows we show that G s shares edges with F U Ufc<z i<k<s This 
will give us that G s shares |G S | + 1 vertices and 1 = |G S | — 1 edges with F U Ufc< s ^k, which given 
the above implies that S± is a bad-sequence. The fact that G s shares edges with F U U/<fc<s 
follows from Claim HP1 We claim that G s shares edges with \J k< i G k - Indeed, if G s does share 
at least one edge with IJfc<z ^ki then since G s also shares at least one edge with G7, we get that 



16 



Gi shares \Gi\ + 1 vertices with F U Ufc<z Cfc- But since Z € [s — 1], we've ruled out that possibility 
above. 

Assume that \G S \ = 1. We show that is a bad-sequence. For brevity, rewrite = 
(i*i,i<2, . . . ,F s -i) and note that F s _\ is the label of ui. By Claim 16.41 the minimality of I' and 
the induction hypothesis we have that for every j £ [s — 1], Fj shares edges with i 7 U \J k< j F^. 
Therefore, by £ we also have that for every j £ [s — 1], -Fj shares vertices with F U IJfc<j -^fc- 
Define (7/ , g s as in the previous paragraph and note that for the same reasons as above we have 
that gi / g s . Also note that gi,g s £ F U Ufc< s -i^fc- Write = and let 2 be the vertex of 

Gi that is not in {x,y}. Assume without loss of generality that Gi and G s share the edge {x,z}. 
Since z is not a vertex of F U Ufc< s -i> we that x is a vertex in g s . Write g s = {w, x} and note 
that w / y. Lastly, since \G S \ = 1 we have that {w,z} £ TFj. Therefore, by definition, S2 is a 
bad-sequence. ■ 

With that we complete the proof of the proposition. ■ 
Proposition 6.6. Pr[£] > 1 - M -1 / 10 . 

Proof. For a bad-sequence S = (G\, G2, ■ ■ ■ , Gi), write {S 1 C B* +1 } for the event that for all j £ [I], 
{Gj C -B*, 1 }. Let Z be the random variable that counts the number of bad-sequences S for which 
{S C It suffices to show that E[Z] < M" 1 / 10 . 

For I £ [2L], < c < Z, let Seq 1 (/, c) denote the set of all bad-sequences S = (G\, G2, ■ ■ ■ , G{) 
with c = \{j : \Gj\ = l,j < l}\ such that Gi shares \Gi\ + 1 vertices and at most — 1 edges 
with FU Ufc</Cfc- For I £ [2L], < c < Z, let Seq 2 (/,c) denote the set of all bad-sequences 
S = (Gi, G2, ■ ■ ■ , Gi) with c = \{j : \Gj\ = 1, j < l}\ that are not in Seq 1 (Z, c). Then 

E ^ = E E E E Pr lscB,y. (2) 

ie[2L] 0<c<Z je{l,2} ffeSeqj- (i,c) 

Below we show that 

VZ £ [2L], < c < Z. Pv [ S ^ B i+i) ^ M~ 1/9 , (3) 

5eSe qi («,c) 

VZ £ [2L], 0< c< Z. Pt [ S ^ b *+i\ < M- 1 / 9 . (4) 

5eSeq 2 («,c) 

From (|2]), (JSJ) and (gj) and since L = O(l), we get that E[Z] < M" 1 / 10 as required. 

We prove ([3]). Fix Z £ [2L], < c < Z. We first count the number of sequences S = 
{G\, G2, ■ ■ ■ , Gi) in Seq 1 (Z, c). To do so, we construct such a sequence iteratively. First, we choose 
the cardinalities of the first Z — 1 subgraphs in S. Note that there are = 0(1) possible choices 
for the cardinalities. Suppose we have already chosen the first j — 1 subgraphs in S for some j < I. 
Given that, we count the number of choices for Gj assuming j > 1. There are O(l) possible choices 
for an edge g £ F U Ufc<j for which Gj £ A*(g,i). Given g: if \Gj\ is to be of size 1 then there 
are at most A\(g,i) choices for Gj and if \Gj\ is to be of size 2 then there are at most A^^, i) 
choices for Gj. Given that we have already chosen the first Z — 1 subgraphs in S, the number of 
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choices for Gi is at most 0(1), since the vertices of Gi are all in FU Ufc<z (*k- Therefore, by £* the 
number of sequences in Seq 1 (Z, c) is at most 

O(l) • (M^iiS) 2 ) 1 " 1 '" ■ (M<S>(i6)(i>(i5)) c . 

Even if we condition on the event that a± edges of F are in Bi + \ and a 2 edges of F are not in -Bj+i, 
we get that the probability of {5 C B* +1 } for S E Seq 1 (Z, c) is at most 

2 \ l—l—c / \ c 



Hence, 



£ PrpCB^J < 0(l)-(m 2 ^) 2 )^ 1_C -(m$(^)^)) C -^ 



5eSe qi (i,c) 



< 0{l) ■ m 2l - 2 - 2c ■ {m\nn) c ■ ^ 



< M- X l\ 

where the second inequality follows from Fact 14.21 and the last inequality follows from the definition 
of L,m and M. This gives us the validity of ([3|). 

It remains to prove ([!]). Fix I E \2L], < c < I. As before, we first count the number of sequences 
S = (G\,G2, ... ,Gi) in Seq 2 (Z,c) and we do it by constructing such a sequence iteratively. The 
number of choices for the first I — 1 subgraphs in S is exactly as in the previous case. Suppose we 
have already chosen the first I — 1 subgraphs in S. We claim that the number of choices for Gi 
is at most 0((lnn) 2 ). Indeed, there are 0(1) choices for an edge {x,y} £ FU Ufc<; ^fc such that 
Gi E A* ({x , y} , i) . Given {x,y}, there are at most O(l) choices for an edge {w,x} E F U \J k< i G^ 
such that w / y. Furthermore, given {x, y} and {w, x}, by £* (specifically by P2) there are at most 
2(lnn) 2 choices for Gi E A*({x, y},i) which has a vertex z that is not a vertex of F U Ufc</ 0& and 
such that {x,z} E Gi and {w,z} E TFj. Therefore, by £* the number of sequences in Seq 2 (/,c) is 
at most 

0(1) • (M 2 ^(*<5) 2 ) , " 1-c • (M$(iS)<Ki8)) c ■ (Inn) 2 . 

Even if we condition on the event that a\ edges of F are in B%+\ and a 2 edges of F are not in 

we get that the probability of {S C B* +1 } for S E Seq 2 (/, c) is at most as given in ©. Therefore, 

^ Pr[SCB* +1 ] < 0(l)-(m 2 ^) 2 )'- 1 - c .(md>(i ( 5)^)) c -^.(lnn) 2 

SeScq 2 (/,c) 

< 0(1) • m 2[ - 2 - 2c • (m In n) c • ^ • (In n) 2 

< m- 1 / 9 , 

where as before, the second inequality follows from Fact 14.21 and the last inequality follows from the 
definition of L,m and M. This gives us the validity of ((4]). With that we complete the proof. ■ 
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7 Proof of Lemma 16.21 



Assume that B* +1 was chosen and condition on £* H £p. Fix an edge / S F. Note that we 
either condition on the event that / is in Bi + \ or not. For simplicity of presentation, we do not 
choose right now which of these two options hold. The exact choice will be made implicitly below, 
whenever we condition on an event which is concerned with the birthtime /3i+i(f). 

Some remarks regarding TJ L follow. The event £ F says that every label of some node in TJ L 
is a label of exactly one node in TJ L . Therefore, we shall refer from now on to the nodes of TJ L by 
their labels. The event £* implies, using the definition of TJ L and Fact 14.21 that for every non-leaf 
node g at even distance from the root of TJ L , 

Number of children of g that are of size 1 = 2m<&(i5)(ft(i5)(l db 1.02T(z)), (6) 
Number of children of g that are of size 2 = m 4>(i 5) 2 (i±imr(i)). (7) 

We need to define the following two additional rooted trees. 
Definition 5 (Too,!)). 

• Let Too be an infinite rooted tree, defined as follows. Every node g at even distance from the 
root has two sets of children. One set consists of children which are singletons and the other 
set consists of children which are sets of size 2. Every node G at odd distance from the root 
°f T^, which is a set of size \G\ 6 {1,2}, has exactly \G\ children. Lastly, for every node g 
at even distance from the root: 

Number of children of g that are of size 1 = \2m&(i5)(p(i5)~], 
Number of children of g that are of size 2 = m 2 (f)(i5) 2 . 

• Let < I < L. Define T\ to be the tree that is obtained by cutting from T^ every subtree that 
is rooted at a node whose distance from the root ofT^ is larger than 21. 

Remark 7.1: Note that m 2 <f)(i5) 2 is an integer. It would be convenient to assume from now on 
that 2m&{i5)<j){i8) is also an integer. Hence, for example, the number of children of the root of Tqq 
that are of size 1 is exactly 2m<&(i5)(f)(i5). We explain in Section [7.41 how to modify our proof for 
the case where 2m$>(i5)4>(i5) is not an integer. 

We continue with some more setup. Note that for every node g ^ f at even distance from the 
root of TJ L , f3i+i(g) is distributed uniformly at random in the interval [0, mre -1 / 2 ]. We extend the 
definition of (3i+\ so that in addition, for every node g at even distance from the root of T^ (and 
hence from the root of Tl), the birthtime Pi + i(g) is distributed uniformly at random in the interval 
[0 5 mn~ 1/2 ]. 

Let T G {TJ l ,Tl,T 00 }. Let go be a node at even distance from the root of T. We define the 
event that go survives as follows. If go is a leaf (so that T ^ T^) then </o survives by definition. 
Otherwise, go survives if and only if for every child G of go> the following holds: if f3i + \{g) < 
min{/3j + i(go), o~n~ l l 2 } for all children g of G, then G has a child that does not survive. 
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For a node g at height 21 in Tj L , let p g< i(x) be the probability that g survives under the 
assumption that f3i + \(g) = xn~ x l 2 . Let pi(x) be the probability, at the limit as n — > oo, that the 
root of T[ survives under the assumption that = ot -1 / 2 , where g here denotes the root 

of T\. Let p(x) be the probability, at the limit as n — > oo, that the root of survives under 
the assumption that f3i+i(g) = in" 1 / 2 , where g here denotes the root of Too. One can show that 
p g ,i{x),pi(x) and p(x) are all continuous and bounded in the interval [0,5]. Hence, we can define 
the following functions on the interval [0,6]: 



px px pa 

P g ,i( x ) '■= / Pg,i(y) d V' p i( x ) '■= / Pi{y) d y and p ( x ) : = / 

Jo Jo Jo 

Observe that for all x £ (0,5]: 

Pr[The root / of Tj L survives | j3i + \(f) < xn 
lim Pr[The root g of T[ survives | < xn 

lim Pr[The root g of T M survives | (3i+i(g) < xn 



p{y)dy. 



1/2] 






X 


1/2] 


Pi(x) 




i 

X 


1/2] 


P(x) 




X 



The next lemma, when combined with the discussion above and the definition of Af,l, implies 
Lemma [~~ 



Lemma 7.2. 

(ii) For all x G [0, 5], p L (x) = p(x)(l ± o(T(i)j(i))). 
(Hi) For all x G [0, 5], pf^(x) = pi(x)(l ± 3T(i)j(i)). 

The proof of Lemma 17.21 is given in the next three subsections. 



7.1 Proof of Lemma 17.21 (i) 

Clearly p(0) = 1 and P(0) = 0. Hence, from the definition of survival and the definition of p(x) 
and P(x), we get that for every x G [0,5], at the limit as n — > oo, 

= exp ( - P(xfcp(i5) 2 - 2P{x)${i8)4>(i8)\ . 

By the fundamental theorem of calculus, p(x) is the derivative of P(x). Hence, we view ([8]) as the 
separable differential equation that it is. This equation has the following as an implicit solution: 

J exp (P 2 (p(i5) 2 + 2P<f)(i5)<S>(i5))dP = x. 
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Solving the above integral, we get 



er&($(i5) + <f>(iS)P) = x + C. (9) 
With the initial condition P(0) = 0, we get from ([9]) that 



Let z > satisfy 



^erfi($(i<5)) = C. 



exp(-z 2 <^) 2 - 2zj>{i5)<f>{i8)) - ^ + ' } " ! 



A simple analysis shows that 

_ + 1)8) - <S>{i6) 
Taking P = z and C = ^erfi($(z5)), we solve ([9]) for x to get 



erfi($(i<5) + 0(i<5)P) - C = \- (erfi(*((i + 1)6)) - erfi(*(W))) = 5, 

i 



where the last equality is by the fact that -^-erfi(<I>(x)) = x. Hence, P(6) = ^^V/l-, and 



= ■ This completes the proof. 



Remark 7.3: As a side note, we observe that < P{6) < 5. Hence we get from the above 
conclusion and from Fact that m+M^Ml = p(6)(j){i6)/6 > and that m±Mz*M 



P(6)<j>(i6)/5 < 1. 



7.2 Proof of Lemma 17.21 (ii) 

Assume first that L is odd. Let go be the root of Tl and T^. Further assume /3j + i(<?o) = ii -1 ' 2 
for some x G [0,5]. Clearly if go survives in Tl then go survives in T^. Hence pl{%) < p(x). 
Below we show that pl(%) > p(x) — n _36e . We claim that this last inequality implies pl(x) = 
p(x)(l — o(r(i)7(i))), which gives the lemma. Indeed, using the fact that x < 5 and since trivially 
P(x) < x, it follows from (JSj) , the definition of 6 and Fact l4.2l that p(x) ~ 1. In addition, by Fact 14.21 
we have that r(i)7(i) = f2(n _35e ). Therefore we get, as needed, 

Pl{x) > p(x)(l - n- 36£ /p(x)) = p{x)(l - o(r(*)7(i))). 

Say that a node g at even distance from the root of Tl is relevant, if 5 and its sibling (if exists) 
have a smaller birthtime than their grandparent, and in addition, their grandparent is either relevant 
or the root. Observe that if the root of survives then either the root of Tl survives, or else, 
there is a relevant leaf in Tl- It remains to show that the expected number of relevant leaves in Tl 
is at most n~ 36e . 

Say that a leaf gL in Tl is a c-type if the path leading from the root to gL contains exactly c nodes 
G at odd distance from the root, which are sets of size 1. Consider a path (go, G\, g%, . . . , Gl, 9l) 
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from the root to a leaf gi,, where gi is a c-type. Let Q be the union of {gj : j £ [L]} together with 
the set {g : g is a sibling of some gj, j £ [L]}. Since gi is a c-type, we have \Q\ = 2L — c. Now if 
gi is relevant, then for every node g £ G, {Pi+i(g) < /3j+i(<?o) = xn~ 1 / 2 } holds. This event occurs 
with probability (x /m) 2L ~ c . Hence, the probability that ql is relevant is at most 

X \ / X \ c / X 2 \ L—c 



mJ \mJ \m 2 



The number of c-type leaves in Tl is at most 

2 L {2m<S>(i5)<p{i5)) c (2m 2 cf)(i5) 2 ) L - c < (4mlnn) c (4m 2 ) L ~ c , 

where the inequality is by Fact 14.21 Hence, the expected number of relevant c-type leaves in Tl is 
at most 

f — V (^) L ~ C (4mlnn) c (4m 2 ) L - c < (4xlnn) 2L ~ c . 



Now, (4xlnn) 2L ~ c < 5 2L ~ c (4lnn) 2L ~ c < 5 L ~ l ~ n~ 4 ° £ , where the inequalities are by x < 5, c < L 
and (41nn) 2i < To complete the proof, note that if a leaf is a c-type, then we have at most 
L + l = 0(1) possible choices for c. Therefore, with the union bound we conclude that the expected 
number of relevant leaves in Tl is at most n _36£ . 

Next assume that L is even, let go be as above and assume /3i+i(go) = xn~ 1 / 2 . The proof for 
this case is similar to the previous case and so we only outline it. It is easy to verify that if g$ 
doesn't survive in Tl then go doesn't survive in Tqq. Hence ^ Now, if ^7o doesn't survive 

in Too then either the root of Tl doesn't survive, or else, there is a relevant leaf in Tl- One can 
now show using the same argument as above that the expected number of relevant leaves in Tl is 
at most n _36e . This completes the proof. 

7.3 Proof of Lemma 17.21 (iii) 

The following implies Lemma 17.21 (iii) . 

Proposition 7.4. Let x € [0, 8], < I < L. Let g be a node at height 21 inTJ L . Then 

Pg,l( x ) = Pli x )( l ± 3r (»)700)- 

Proof. The proof is by induction on /. The assertion holds for the base case since by definition, 
Pg,o( x ) = Po( x ) = 1 f° r an x ^ [0> <5]" Let 1 < I < L and assume that the proposition holds for I — 1. 
Fix x G [0, 5] and let g be a node at height 21 in T? L . 

For brevity, define rj := T(i)j(i). Further, let 

/ Pl_l(x)(l - 377)^2m$(i5)0(i5)(l-l.O2r(i)) 

Lj : — ll 



m 

i Fi_l(x) 2 (l - 3r]) 2 \rn 2 HiSni-l-02m) 

m 2 
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and 

/ P W (i:)(l + 3 ?7 )^2m*(ii)^(i5)(l+1.02r(i)) 



m 

^ P_i(x) 2 (l + 3?7) 2 \™ 2 0(^) 2 (i+i-O2r(i)) 
m 2 / 

Let g' be a grandchild of g. By the induction hypothesis and by definition of P g ij-i(x) and P/_i(x), 

Pfl/,i-i(x) = Pj_i(x)(l±3r7). 

Thus, it follows from the definition of survival and by ([6]) and ([7]) that 

Q* < P g ,i(x) < Q* ■ 

It remains to bound Q* and Q*. In what follows we use the fact that 

Vz>l. exp(-l/(z - 1)) < 1 - l/z < exp(-l/z). (10) 

To bound Q*, we have 

/ fj_i(ar)(l-377)\2*n*(«)*(«) / Pj_i(x)\2m*(i<M^)(l-0(rj))(l-l/m) 



???. 



< 1 









m 


P- 


.i(x) 




m 


P- 





2m$(i<5U(i<5) / A \ -0(m<J>(i<5U(i<5)»V) 

< M_Z^-j I"" 

V m / V m/ 

< exp ( - 2fJ_ 1 (x)*(i*)^(i*)) • (1 + o( V )), 

where the first inequality follows from (|10p ; the second inequality follows from the fact that 1 /m = 
o(ji), which in turn follows from the definition of m and from Fact 14. 2 1 the third inequality follows 
since p_i(x) < x < 6; and the last inequality follows from (|10p and Fact 14.21 For similar reasons 
we also have that 

l iy 1 (a) 2 (l-3T7) 2 \m i W < / 
m? J ~ V 

< (1 



Pi 


-iW 2 




m 2 


Pi- 


-i(*) 2 




m 2 


Pi- 


-i(x) 2 


m 2 



)2 N m 2 Mi5) 2 / p N -0(m 2 (/.(i<5) 2 r)) 
< (1- - „ ) ' 



< exp {-Pi^{xf<t>{i5f) -(1 + 0(7?)). 
In addition, since P/_i(x)(l — 2>rf) < x < 5, and by definition of we have 

P l(x)(l - 37?) \ -2m*(i<5)*(i<5)-1.02r(t) / ^ \ -2m$(i«5)0(i,5)-l.O2r(i) 

j u A /M < /j < 1 + 2.05??, 

m / V m/ 

/ P_i(x) 2 (l-3r7) 2 \-^ 2 ^) 2 -i-02r(i) / ^ N-wV(M) a .i.Qar(i) 

(1- ' U , — J < 1 ») < 1 + I.O377. 

V 777/ / V m z / 

Then by the fact that 

Pl (x) = exp ( - P^x) 2 ^) 2 - 2P_ 1 (x) 2 $(i(5)0(^)), 
we can conclude that 

Q*< K (^)(l + 3r/). 
The argument for the lower bound on is similar. 
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7.4 When 2m&(i5)(fi(i5) isn't an integer 

We have defined the tree so that for every node g at even distance from the root, the number 
of children of g that are sets of size 1 is exactly |~2m#(i<J)0(£<5)~|. We further made the simpli- 
fying assumption that 2m$>(i5)(f>(i5) is an integer. Reviewing our proof above, we needed this 
simplifying assumption in order to get a relatively simple solution to the differential equation in 
Section 17,11 Here we briefly explain how one can modify the proof above so as to handle the case 
where 2m<&(i6)(f)(i6) is not an integer. 

The first step would be to take a random subtree of TJ L . Let C, 6 [0.1,0.9] be such that 
£ • 2m&(i5)(f)(i6) is an integer. Keep every subtree of TJ L that is rooted at a set of size 1 with 
probability £. This gives us a random subtree oiTJ L . Prom now on we only care about this random 
subtree and so for brevity, we refer to this subtree by Tjf L . Using the fact that £* holds, one can 
show the following. With probability 1 — n~ w( - 1 \ for every non-leaf node g at even distance from 
the root of TJ L , the number of children of g that are sets of size 1 is £• 2m$(i8)(j)(i5)(l =t 1.02r(i)) 
and the number of children of g that are sets of size 2 is as given in ([7]). Given that, we change the 
definition of accordingly by asserting that for every node g at even distance from the root of 
Too, the number of children of g that are sets of size 1 is £ • 2m$(i6)(f)(i6). 

Having redefined the above trees, the second step is to redefine the distribution of the birthtimes 
of the edges in L and T^. The birthtime of an edge that appears in a set of size 1 in Tj? L or in 
is redefined so that it is distributed uniformly at random in [0, £ • mn -1 / 2 ], whereas the birthtime 
of an edge that appears in a set of size 2 in TJ L or in remains uniformly distributed at random 
in [0,mra _1//2 ] as before. 

The rest of the proof is straightforward. In particular, the statement of Lemma 17.21 is not 
changed. The only necessary other modifications are the obvious ones that follow from the above 
changes in the definition of L and and the definition of the birthtimes of the edges that 
appear in those trees. 
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A Proof of Fact D 

Recall that -^erfi(<I>(x)) = x, where erfi(x) is the imaginary error function, given by, for example, 

■> _ _ 

that as x — > oo, <£(x) — > \/\nx and 4>{x) — ► {2xy/\nx)~ l . 



erfi(x) = -J= YIJLq j\[2j+i) ■ ^ e have that erfi(x) — > exp(x 2 ) / (y/irx) as x — > oo. Hence, it follows 



(i) We first upper bound 4>{i5) and $(i<5). We have that erfi(x) > if and only if x > 0. By the 
fact that ^erfi($(x)) = x we have erfi($(z5)) = 2z5/ v / 7r > 0. Hence Q(i5) > 0. Therefore 
4>(iS) = exp(— <3?(i<5) 2 ) < 1. Next, note that erfi(x) is monotonically increasing with x. We 
also have by ^erfi($(x)) = x that erfi(<5(z5)) is monotonically increasing with i. Hence 
&(i5) is monotonically increasing with i and so <&(i5) < <&(I5). The upper bound on &(i5) 
now follows since IS ~ n e and so &(I5) ~ \/ln n e . 

Next, we lower bound 4>(i5) and <£(i<5) (for i > 1). Since &(i5) is monotonically increasing 
with i, we have that (j)(i5) is monotonically decreasing with i. Therefore, it remains to 
show that <p(I5) = 0(5 L5 ) and $(<5) = Q(5). The fact that <j>{I§) = ft(5 L5 ) follows since 
4>(I5) -» l/(2I5VlnI5). The fact that $(<5) = £1(5) follows directly from the fact that 
-^erfi(<I>(x)) = x and the definition of erfi(x). 

(ii) By (i) we have 5<f>(i6)<t>(i6) < 5lnn = o(l) and 5 2 0(i5) 2 < 5 2 = o(l). Hence 7(1) = o(l). It 
also follows directly from the definition of and from the previous item that 7(2) = Q(5 5 ). 

We now bound T(i). Since T(i) is monotonically non-decreasing and T(0) = n _30e , it is enough 
to show that T(J) < n~ 10e . We do that by first showing that r(<5 _1 [hilnnj) < n -30e+o(l)_ 
For brevity, we shall assume below that [In In nj = In In n. 

For every < i < £ _1 lnlnn, $(i<5) < In Inn (crudely) and <j){iS) < 1. Therefore, we have 
that for every < i < <5 _1 In Inn, 

SG>(i5)<j)(i5) < 5 In Inn, and 
5 2 (j)(i5) 2 < 5 In Inn. 

Hence, for < i < In Inn, 7(1) < 5 In Inn and so 

r^lnlnn) < n- 30£ (l + 10<51nlnn)^ llnlnn = n - 30£+o ^ . 
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Now, note that for every S 1 In In n < i < I, 

6&(i6)<t>(i8) < 0.6/i, and 
S 2 (f>(i5) 2 < 0.6/i, 

and this follows from the fact that for <5 _1 lnlnn < i < I, $(i5)4>(i5) ~ l/(2id) and 4>{i5) < 
l/(2iS). Hence, for 5 _1 lnlnn < i < I, < 0.6/i and so we conclude that 

r(7) < n ~ 30£+o ^ Yl (1 + 6/i) < n - 30£+o(1) • exp(71n/) < n" 10 ". 

l<i<I 



26 



